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THE LITERATURE ON LANGUAGE ABILITY AND ITS RELATIONSHIP 
TO ACADEMIC SUCCESS INCREASINGLY VOICES ALARM THAT THE 
CULTURALLY DISADVANTAGED ARE SERIOUSLY DEFICIENT IN LANGUAGE 
ABILITY. INTERVENTION PROGRAMS CREATED TO CORRECT THIS 
PROBLEM ALL RECOGNIZE THE IMPORTANCE OF LANGUAGE TOOLS FOR 
INTELLECTUAL FUNCTIONING. ESSENTIAL TO SUCH PROGRAMS, AND 
FREQUENTLY ABSENT FROM THEM, ARE RELIABLE MEASUREMENT 
TECHNIQUES FOR EVALUATING THE INTERACTION OF THE PROGRAM WITH 
THE CHILD’S COGNITIVE PROCESSES. THE PRESCHOOL LANGUAGE 
PROJECT AT THE UNIVERSITY OF CALIFORNIA AT LOS ANGELES IS 
CONSTRUCTING AND USING NEW MEASURING DEVICES— FOR EXAMPLE, 

THE VISUAL DISCRIMINATION INVENTORY <VDI) . THE VDI WAS 
ADMINISTERED TO 291 PRESCHOOL CHILDREN TO OBTAIN DATA ON THE 
VISUAL DISCRIMINATION ABILITY OF THE 199 NEGRO AND 92 
CAUCASIAN CHILDREN. THE CHILDREN REPRESENTED TWO LEVELS OF 
ECONOMIC STATUS AND RANGED IN AGE FROM 3 TO ALMOST 6. 
ADMINISTRATION OF THE VDI INVOLVED PRESENTING THE CHILD WITH 
A MODEL FIGURE AND THREE CHOICE FIGURES FROM WHICH TO SELECT 
THE ONE THAT MATCHED THE MODEL. AGE AND RACE APPEAR TO BE 
RELATED TO DISCRIMINATION ABILITY. THE VDI WAS FOUND TO HAVE 
BOTH RELIABILITY AND VALIDITY. THIS DOCUMENT IS COMPOSED OF 
TWO REPORTS. ONE WAS PRESENTED AT THE BIENNIAL MEETING OF THE 
SOCIETY FOR RESEARCH IN CHILD DEVELOPMENT <NEW YORK, APRIL, 
AND THE OTHER, AT THE AMERICAN PSYCHOLOGICAL 
ASSOCIATION ANNUAL MEETING {WASHINGTON, SEPTEMBER, 1967). 
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Evaluating Language Curricula for Preschool 

1 2 

Disadvantaged Children 
Carolyn Stern 

University of California, Los Angeles 

There are some linguists who maintain that children from culturally-dis- 
advantaged homes are not, per se, language handicapped; that they have a func- 
tionally complete, although perhaps different, language system of their own. 
Whether or not this Is true from a linguistic point of view Is relatively Imma- 
terial to educators who are faced with the high Incidence of school failure 
among these "linguistically-different" children. There Is no doubt that In 
almost every measure of verbal ability, visual and auditory discrimination, 
and other cognitive and perceptual skills closely related to successful per- 
formance In academic tasks, children from the "culture of poverty" score signif- 
icantly below those from middle-class homes. 

Herbert Birch (1967) called attention to the danger of drawing Inferences 
of causality from correlational data. Because a set of variables can be shown 
to covary in a dependable relationship under a given set of circumstances does 
not necessarily mean that the modification of one will produce a predictable 
change in the other. Edward Gordon (1965) has pointed out that correlational 
studies offer no guidelines for planning interference procedures and may be 
harmful in that they provide a spurious statistical basis for a popular myth- 
ology. However* the evidence from psychological studies convincingly demonstr- 
ates that the ability to perform logical operations, reason inferential ly, and 
extract generalizations from the abstract manipulation of experience is Intrinsi- 
cally tied to the mediational function of language. 
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The consistent findings, from experiments throughout this country as well 
in other countries (e.g. Israel), that language deficits characterize child- 



ren from low-income homes, and that the deficits are greatest in those uses of 



language most closely related to cognitive behavior, have led to what amounts 
to a revolution in early childhood education. Five or ten years ago, the nur- 
sery school teacher would have been highly resistant to the inclusion of system- 
atic language development in her curriculum. She was much more concerned with 
block play, arts and crafts, and music and rhythms, all of which activities were 
used to provide opportunities for developing that ultimate desideratum: social- 

ization. Contrast this with a recent survey in which 119 Headstart and Day 
Care teachers selected additional course work in "developing language skills" 
most frequently as either the first, second, or third choice from a list of 
25 possible training courses (Gordon, 1966). Contrast this also with the rash 
of language-based compensatory preschool programs, ranging from the complex 
capacities of the quarter-of-a-mill ion-doll ar Edison Responsive Environment 
(more popularly known as the "talking typewriter") to equally revolutionary 
procedures with less expensive materials and equipment. 

These intervention programs may be conceived as lying along a continuum 
defined in terms of structure* The Bereiter-Engelmann (1966) program described 
by Jean Osborn (1967) would be at one end of the distribution, that of Pat 
Minuchin and Barbara Biber (1967) at the other. In the first, the position is 
that language deficiences can best be compensated by a systematically-struc- 
tured program which emphasizes patterned repetition and drill. Based on "logi- 
cal rather than linguistic analogies," it conceives language as the "vehicle 
for teaching of concepts and the rules for manipulation of concepts" and does 
not concern itself with the "social and expressive uses" of language. 



The developmental approach sees language as but one aspect of the child's 
growth pattern. Because it is considered to be dependent upon many non-verbal 
factors, with important emotional and psychological components, the language 
program is placed within the total experiential context. 

Between these two positions are the approaches used by Susan Gray (Gray 
and Klaus, 1966) at Peabody, David Weikart (1966) at Ypsilanti, Walter Hodges 
(Hodges, Stearns, and Spicker, 1966) at Indiana, and Allan Hartman (1966) in 
Harrisburg, Pennsylvania, to name but a few. The materials used by Larry Gotkin 
(1965, 1967) are part of a comprehensive and many- faceted program which the 
Institute for Developmental Studies is carrying out in New York City. Finally, 
Project Headstart has put preschool intervention on a national basis. 

Throughout all of these programs, in some more explicitly than others, 
there is an awareness that language is the key to unlock the child's ability to 
learn. The more complex and varied the language tools the child is capable of 
manipulating, the greater will be his control over those basic cognitive pro- 
cesses which are prerequisite to the type of intellectual functioning which 
leads to school success. (Cf. Bernstein, 1964; Carroll, 1964; John and Goldstein, 
1964; Barbara Gordon, 1965, to select a few of the earlier articles from what 



has now become a voluminous literature.) 

There is usually some provision for measuring the effectiveness of the be 



havioral changes which are presumed to occur as a result of a particular program. 
In most cases the instruments used are selected from existing standardized tests. 
Among the most frequently used for such pre- and posttests are the Stanford-Binet, 
Peabody Picture Vocabulary, Goodenough Draw-A-Man, and the Illinois Test of Psy- 
cholinguists Ability. The Institute for Developmental Studies has undertaken 
the construction of new instruments for use with disadvantaged children. The 








Reading Prognosis Test (Feldmann and Weiner* 1962), the Standard Telephone Inter- 
view (Gotkin, et al., 1964), the Story Re-tel ling Technique (John and Berney, 
1967) and the Children's Language Sample (Scher and Horner, 1967), are attempts 
in one way or another to evaluate children's language in naturalistic settings, 
thus avoiding the criticisms (cf. Salzinger, 1967) that ratings based on single- 
word vocabulary tests do not present a meaningful picture of verbal facility. 
Unfortunately, however, there is no simple, objective procedure for scoring or 
categorizing the tremendous corpus of children's language which can be quickly 
and economically collected by these techniques. The hundreds of hours of lis- 
tening, by trained phoneticians, to many repetitions of the taped interviews, 
the necessity to obtaip some degree of consensus from several judges for each 
speech sample, and the difficulties in rating the transcribed utterances, make 
these procedures inappropriate for a testing instrument to be used with a large 

population. 

Basically, the problem of evaluating any specific program is often con- 
founded by a lack of explicitly stated behavioral objectives. In some cases 
where the objectives are explicated, there is a lack of correspondence between 
the stated goals and the instruments used to evaluate the terminal behavior. 

It is obvious that evaluation instruments must be designed so as to measure the 
effectiveness of a particular procedure in accomplishing what it sets out as 
its goals. However, there is the far more fundamental problem of determining 
the rationale which sets these immediate -objectives within the framework of the 
long-range goals of intervention and innovation in education. 

Probably the most prevalent, and most often left implicit, goal of a lang- 
uage remediation program is that of imposing middle-class speech patterns, even 
if only as a "second language," on children who use dialect speech. Here the 
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major emphasis is on the social values* of “good" English. The argument for this 
objective is that standard English is an essential ingredient in effective cog- 
nitive functioning. 

Closely related to both of these purposes is that of raising scores on 
intelligence tests. While the correlation between intelligence and academic 
success is well established, the meaningful ness of I.Q. scores for this popu- 
lation when obtained with existing instruments is open to question. Glide (1966) 



has pointed out the fallacy of first interpreting performance on intelligence* 
tests as reflecting underlying cognitive structures, and then inferring from 
improvement in scores on these tests that fundamental changes in cognitive str- 
uctures have occurred as the result of the intervention activity. The implied 
distinction between performance and innate ability is highly reminiscent of 
the 3 performance-competence controversy which has raised the blood pressure and 
temperature of linguists and psycholinguists. For the product-oriented re- 
searcher who is intent on producing specific and measurable behavioral changes 



in disadvantaged children, the fine points of this argument are of little prac- 
tical value. Even if all an intervention program does is teach these children 
how to "play the game" and motivate them so that they do become successful com- 
petitors in the academic arena, the program has had an obvious and important 
beneficial effect. 

♦ 

Unfortunately, it is especially difficult to estaUish that a particular 
type of instructional sequence has achieved the objective of preparing a child 
for continuing success in school tasks. This type of evidence can only be 
gained from longitudinal studies in which appropriate curricula are presented 
over several years of schooling. It is patently absurd to imply that a compen- 
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satory program has been a failure because initial gains are dissipated when the 
child is placed in an inadequate primary curriculum. The position of this 
paper is that any number of approaches to language development are permissible 
and valid as long as the stated outcomes of the program can be demonstrated in 
overt and measurable behavior. 

At UCLA, the Preschool Language Project has accepted as expedient the 
objectives of teaching children to use standard English both as a tool with 
which to control their environment more effectively, and as a conceptual system 
with which to process information, perceive relationships, and perform logical 
operations. There is the collateral expectation that I.Q. scores will show 

increases, along with improvement in school performance, maintained through 
the first grade. 

The program provides for two types of evaluation: summative and form- 
ative (Scriven, 1965). The former looks primarily at the terminal behavior, 
the latter is the ongoing process which tests each phase of the instructional 
program while it is being developed. For the first type of evaluation, two 
traditional instruments are being used, the Peabody Picture Vocabulary Test 

and the Goodenough Draw-A-Man Test. In addition, several new instruments are 
being constructed. 

The first of these is a measure of auditory discrimination which avoids a 
critical weakness in existing instruments of this type. In many tests, the 
child's ability to discriminate sounds Is measured in terms of how well he can 
classify pairs of auditory stimuli as either the "same" or "different." Cult- 
urally-disadvantaged preschool children find this task very difficult. A test 
which involves the comparator function will have low validity insofar as scores 
reflect the child's level of ability to understand the instructions rather than 
to discriminate the stimuli. 
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The Children's Auditory Discrimination Inventory (CADI) uses 38 pairs of 
pictures. One of each pair represents a familiar object, called by its appro- 
priate label; the other is a nonsense picture to which a nonsense disyllable 
has been arbitrarily assigned. The child is told the names of the pictures by 
the examiner, and then given the stimulus to which he is to respond. For ex- 
ample the sample item in Figure 1, "This is a table and this is a pable. Put 

I M U I MC 

your finger on the pable." On the whole, an equal number of real and nonsense 
words are called for, but in random order. 

The words selected were based on a hierarchy of difficulty of phonemic 
contrasts, ranging from exceedingly gross discriminations (e.g. girl - hujuj) 
to minimal pairs (e.g. fish - fith). The real words and pictures have been 
pretested with a comparable population and found to be well within the child- 
ren's response repertoire. The nonsense stimuli were selected from those 
rated low in association value in a study with a similar population. 

A second instrument, the Visual Discrimination Inventory (VDI), is con- 
cerned with the assessment of the child's ability to discriminate visual stim- 
uli. Most available tests of this skill require the child to produce a written 
response which usually demands a high order of motor control. Again, the mea- 
sure of discrimination is confounded with the irrelevant characteristics of the 
response mode. To provide a more valid test of discrimination, the VDI re- 
quires the child to indicate his ability to discriminate forms by making a 
simple selection response. The four areas under which the tasks are subsumed 
are: form constancy, figure-ground, closure, and position-in-space. (See 
Figure 2) There are 52 items in the test, which takes approximately ten minutes. 
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Before the test is administered, a preliminary training sequence pro- 
vides familiarization with the selection task. The children are taught to 
match one of three pictures with a model, using familiar objects such as a 
cat, a ball, and a bat. Only after ten correct selections in a row have been 
made is the child given the actual test items. 

These two tests are included in our evaluation battery because there 
is a pervasive belief, based on what may be- termed a commonsense validity, 
that the way children discriminate sounds and forms is closely related to the 
acquisition of reading skills. While there is ample correlational evidence 
that good readers are more apt to be good discriminators and poor readers poor 
discriminators, there have been no successful experiments to demonstrate that 
improving performance in discrimination will also improve performance in read- 
ing. After sufficient normative data have been gathered, and the reliability 
of the instruments established, a number of experiments will be designed to 
test the hypothetical relationship between beginning reading and auditory and 
visual discrimination. 

The instrumental use of language is most clearly demonstrated in its ex- 
pressive function. At the simplest level of complexity is echoic responding or 
Imitative behavior. A mynah bird or a parrot is capable of this type of pro- 
duction. However, it can be assumed that the ease with which an utterance can 
be produced is a function of the degree of familiarity the responder has with 
the particular chain of verbal stimuli. Thus it may be much easier for an 
adult to repeat a long sentence in his native language than a short phrase* in an 
unfamiliar one. To obtain a measure of the child's range of sentence complex- 
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ity, the Echoic Response Inventory for Children (ERIC) presents a sequence of 
20 sentences,, differing in length and transformational difficulty. The first 
few sentences are short, such as "Dogs bark." or "Babies drink milk." A more 
difficult sentence would be "If it's late I have to hurry." A sheet with all 
the sentences written out is marked by the examiner to indicate any deviations 
from the stimulus sentence. (See Table 1 for sample sentences.) This is kept 

I N MC T 5 B PIC A M T I H I R I 

as a record of the child's performance. It can be compared with his perfor- 
mance at a later date to obtain an objective measure of change in behavior 
over time, with and without intervention procedures. 

While there has been increasing criticism of the use of vocabulary as an 
index of verbal facility, the fact remains that performance on vocabulary tests 
is still used as the major criterion for selection for college (College Entr- 
ance Examination) and even graduate school (Graduate Record Examination). 

The Peabody Picture Vocabulary has been widely used in evaluating the effects 
of compensatory preschool programs. However, it covers a wide range of ,«ental 
ability, and perforce provides only a small sample of words for the young child. 
Also, it does not require the child to produce the word, only to select, out of 
four pictures, the one with which it is associated. The Expressive Vocabulary 
Inventory (EVI), developed for the Preschool Language Program battery, differs 
in both these respects. First, it includes 40 items, selected from a list of 
words obtained in a field test to determine the vocabulary children are expected 
to possess when they enter kindergarten. By providing a larger number of suit- 
able items, it should be a more reliable Instrument for this age group. Secondly, 
the response required from the child is not a pointing or selection response but 
a verbal one. Whereas most vocabulary tests are heavily weighted with nouns. 
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the stimuli in the EVI represent a variety of parts of speech. There are pro- 
gressive verb forms, prepositions, adjectives, and adverbs, as well as verbs 
and nouns. For instance, the child is shown a picture of a boy swimming, and 
is asked, "What's the boy doing?" Credit is given only if the -ing form is used. 
Another picture shows a cat in a box, with the question: "Where's the cat?" 

The child gets credit only if he uses the appropriate preposition. (See Table 
2 for sample items.) 

MEfrlTmxAMOM 

Two additional tests are being developed to get at larger samples of 
childrens' speech. The first is a semi -structured Verbal Output Inventory. In 
this instrument, children are shown five black-and-white line drawings, one at 
a time. There are two country scenes, a middle-class urban scene, an urban, 
slum street scene, and a picture of a typical zoo. For each scene, the child- 
ren are given a five second interval to simply look at the picture. Then 
they are instructed to name as many things as they can. When they have stopped 
labeling, they are asked to tell what is happening in the picture. Finally, 
they are encouraged to imagine what is going to happen next. (See Figure 3 
for sample items.) 

BMOIgM I MC1SPI 

The second language sample is obtained from a Structured Story-Telling 
Test which requires the child to produce his own story on the basis of pictorial 
stimuli. For the first story, the child is shown two separate black-and-white 
line drawings. In the first picture, a boy is flying a kite. In the second 
picture, the same boy is standing and watching the kite flying away. The child 
is asked to look at both pictures (the examiner indicates by pointing to each 
picture consecutively) and to tell a story about them. The second story has 
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three separate pictorial stimuli which tell a similarly simple and familiar 
story. The scoring system for this task is still in the process of revision, 
but there will undoubtedly* be some credit assigned for recognition of story 
continuity. 

Another series of tests, the Language Comprehension Inventory, attempts 
to evaluate the child’s ability to respond appropriately to the verbalizations 
of others. A subtest aimed at measuring ability to respond to prepositions 
requires the child to place a checker in the appropriate relationship to a 
small box (in, on, behind, etc.) or two boxes (between). Another subtest pro- 
vides the child with a booklet, each page of which consists of a model and 
two or three alternatives. The child is asked to look at the model and then 

mark the picture which is exactly like it. 

Verbal mediation may well be considered the highest level of language 
usage. The way children use their own language to help them solve problems, 
as well as their ability to cope with those simple logical operations control- 
led by sentential connectives such as "both - and", "either - or", and "not" 
are tested in a series of subtests involving conjunction, disjunction, and 
negation. Children are also tested on their ability to draw inferences and 
solve problems on the basis of data which they are given. 

This battery of evaluation instruments is designed to assess the child's 
ability to use language in expressive* receptive, and mediational tasks. De- 
signed as pre- and posttests for the Preschool Language Program, they are not 
intended specifically for this particular language-training approach. None of 
the items appear in any of the instructional programs; however, it is hoped 
that there will be generalization and transfer from the training to the test 
items. As Shulman (1966) has pointed out, while the insistence on behavioral 
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statements of objectives forces educators to think with extreme precision about 
the hoped-for outcomes, there is a danger that this emphasis on specifying be- 
havior will restrict the scope of the instructional objectives. Courses thus 
become training in test- taking behavior. It should not be forgotten that the 
basic objective of instruction is to produce transfer to new situations and new 
tasks in which the trained skills are required. 

Another approach to evaluation is being explored under a separate grant 
from the Center for the Study of Evaluation of Instructional Programs at UCLA. 
From this point of view, it would be important to measure the rate or effic- 
iency of learning under different instructional procedures. All children will 
be given several days of instruction with the same materials, but using differ- 
ent pedagogical techniques. However, if the content is drawn from the universe 
of knowledge about the real world, children from different homes will have dif- 
ferent types of previous experience. It has long been recognized in learning 
experiments with older subjects that association value, or meaningful ness, is 
an important confounding variable. The fact that certain learning occurs more 
readily under certain conditions, or with certain subjects, may be more closely 
related to the characteristics of the materials than to the nature of the instr- 
uctor or the learners. To provide the controlled materials for a variety -of 
learning experiments with young children, a repertoire of nonsense syllables- and 
nonsense pictures of several measured levels of meaningful ness are being pre- 
pared and tested with a socio-economically representative population of child- 

. I 

ren between the ages of four and six years. 

There is no doubt that all these evaluation instruments neglect a number 
of affective variables which have a potent impact on the rate and nature of 
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learning. Among the most significant of these are impulsivity, motivation, and 
attention. Middle class children have already learned to value the praise and 
approval of adults. The effectiveness of these secondary reinforcers in con- 
trolling the behavior of children from low-income homes is not equally well 
established. Part of the task of an intervention program is to teach these 
children to obtain satisfaction frim achievement in academic tasks both from 
their own sense of increased competence as well as from the contingent approval 
of adults (Temp, 1967). It is quite possible that many of the gains in I.Q. 
scores reported in evaluations of intervention programs are more a reflection 
of the fact that the child has learned to respond more appropriately to the 
examiner's expectations, as a result of reinforcements within the school sit- 
uation, than anv improvement in ability to perform tne various tasks (Glick, 
1966). 

A major source of difficulty in evaluating instructional programs is that 
the instruments used are designed to measure individual differences rather than 
program effects on groups (Trismen, 1967). The use of a large pool of test 
items, as advocated by Cronbach (195?), would make it possible to construct a 
number of different tests to be administered to groups of children who have re- 
ceived a particular instructional procedure. For this item pool it would be 
quite appropriate to include at random items selected from the instructional 
material, as well as items over new material. A test of the oujectives of both 
learning and transfer could thus be provided. 

This procedure is particularly useful with young children who cannot sit 
through lengthy test periods. The test would not oe a useful measure of the 
achievement of individual students, since each child would be tested only on 
one or two items over a wide variety of skills. Nor would they be useful for 
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comparing performance of children with others in the class, since each child 
in a particular class might be giver, a different set of items. However, a 
reliable basis for evaluating the total effect of the instructional program in 
achieving its stated objectives could be obtained by gathering data at random 
over a large number of children. 

In closing, it might be appropriate to mention one of the most sticky of 
all evaluation problems, and one which most program evaluations tend to avoid. 
That is, how worthwhile are the goals which are being evaluated? Even if it 
can be proven beyond doubt that specific behavioral objectives have been achie- 
ved, is the learned behavior really important or relevant? Does improving the 
child's ability to discriminate environmental sounds and distinguish differ- 
ences and similarities in geometric or. pictorial visual forms actually lay a 
solid foundation for learning to read? Does exposing disadvantaged children 
to a variety of experiences, so that they can produce coherent stories about 
them in a limited, dialectical form, mean that these children can then use 
language to conceptualize, classify, and form schemata with which to integrate 
the data in the real world? 

More fundamentally, most preschool intervention programs are designed to 
prepare disadvantaged children for entrance into the traditional middle-class 
kindergarten; in kindergarten, children are being prepared for first grade; 
firstgraders for second, and so on down the line. Preschool programs evaluated 
on this type of criteria are successful if they demonstrate that children who 
complete the program are therefore more likely to succeed in kindergarten, and 
even in first grade. But is this the ultimate goal? It is true that the UCLA 
Preschool Language Program has accepted "as expedient" the objective of success 
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in kindergarten and first grade. It is equally important to develop instru- 
ments to evaluate how well a particular curriculum achieves its stated goals. 
But this does not minimize the Importance of at some point attempting to eval 
uate the goals themselves. 



Footnotes 

^ Paper presented at Society for Research in Chi 1 d Development, Biennial 
Meeting, New York, April, 1967. 

^The major part of the research reported in this paper was performed under 
contract with the United States Office of Education, Department of Health, 
Education, and Welfare, Cooperative Research Program, Contract Nunber 
0E-5-85-045, and by the United States Office of Economic Opportunity, 
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table 1. Sample Items from Echoic Response Inventory for Children. 



Form A 

1. Dogs bark. 

5, He pushed the door too hard. 

10. Quite a good show is playing. 

15. If it's late, I have to hurry. 

20. If the ground is wet the children 
won't be able to play in the park. 



Form B 

1. Birds fly. 

5. He bounced the ball too high. 
10. Quite a large dog is barking. 
15. If it's late, he has to run. 
20. If the weather is cold the 
children won't be able to, swim 
at the beach. 



Table 2. Sample Items from Expressive Vocabulary Inventory. 



It A., 

No. 


Description 
of Picture 


Question 


Scoring 
Key Word 


1 . 


fish 


"What is this?" 


"fish" 


5. 


boy swimming 


"What's the boy doing?" 


"swimning 


9. 


cat in a box 


"Where's the cat?" 


"in" 


33. 


large and small ball 


"This ball is smaller. 


"larger" 


* 

38. 


circle and square 


what is this ball?" 

"This is round. What is this?" 


"square" 
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Fiqure 1. Samole Item from Children's Auditory Discrimination Inventory. 
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Figure 3.- Sample Stimuli: Verbal Outo&t Inventory for Children, 
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AN INSTRUMENT TO MEASURE VISUAL DISCRIMINATION OF YOUNG CHILDREN 1 ’ 2 

Carolyn Stern and Avima Lombard 
University of California, Los Angeles 
A strong prima-facie case can be made for the relationship between 
visual discrimination ability and success in reading. The obvious need 
for the reader to make consistent and dependable responses to specific 
visual stimuli (graphemes) provides a persuasive logic for this widely 

held assumption. While reading, because of its basic role in all aca- 

* 

demic learning, is probably the most extensively studied subject in the 
curriculum, the literature is notable for the ease with which sweeping 
generalizations are propounded and accepted, with little support from 
experimental research. It is also notable for the large numbers of re- 
ports and articles deceptively decked out in the mini-skirt of experimental 
design. 

William Gray has authored a* series of reviews of research relating to 
reading dating back to 1925. He makes the statement that the accurate recog- 
nition of words involves a number of perceptual processes. Gertrude Hildreth 
(1950), an equally reputable reading specialist, recommended informal training 
in visual perception skills, with emphasis on matching and "looking", but on 
an experiential level rather than in a structured or formal workbook context. 



Hhis work is being supported by the United States Office of Education, 
Project Number 0E-5-85-045, and the United States Office of Economic Oppor- 
tunity, Project Number IED 66-1-12. 

2 

A preliminary version of this research was presented at the American 
Psychological Association Annual Meeting, Washington D.C., September, 1967. 
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This generalized perceptual training approach is supported by Anderson and 
Dearborn (1952) and Bond (1960). More recently. Arena (1965) recommended 
training in visualization and motor skills as remediation for “more subtle 
language problems." Delacato (1963) bases his entire system of reading 
instruction on the early acquisition of perceptual -motor skills. 

In recent years, the emphasis on conducting learning research In the 
school setting has stimulated a great many studies, a large proportion of 
which have little claim to objective validity. Even experiments which main- 
tain careful controls, however, produce conflicting results. Hillerich (1964) 
found that formal workbook training in pre-reading skills in kindergarten pro- 
duced better readers at the end of the first grade, whereas Goins ,* 

Ploghoft (1959), Gorelick (1962), arid Fry (196$) found that children who had 
used readiness workbooks during kindergarten did not have greater success In 
beginning reading than children who had had unstructured readiness experiences. 

Many pre-kindergarten programs today are placing great emphasis on "Same- 
different" discrimination training in both auditory and visual modalities, 
although McKee (1948) had found that such exercises were not useful in devel- 
oping word recognition. Instead, he demonstrated that the best way to Improve 
word recognition was to provide drill in word recognition. Both King (1963) 
and Popp (1964) have obtained results which seem to support the position that 
training in discriminating letters and words does improve the ability to make 



such discriminations within a reading context. 

While Dunn (1965) maintains that visual perception is more Important than 
vocabulary in predicting ease of learning to read, Ashlock (1963) could find 
no support for a relationship between visual perception and reading In terms 
of either the task or the content of the training materials. However, it was 
found that the importance of visual perception skills in predicting levels of 
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reading achievement decreases with age. This may very well reflect the age- 
related changes In visual perception reported by Elkind, Koegler, and Go 
(1964), and Bralne (1965.) 

Although a strong correlation between reading and Intelligence Is well*" 
established, the relationship between visual perception and Intelligence Is 
still unclear. Allen, Haupt, and Jones (1965) found low correlations between 
visual perception and the spatial -visual subtests of the WISC. Birch and 
Belmont (1964, 1965) noted that whereas the correlation between I.Q. and 
reading Increased with age, this was not true with tests of auditory-visual 
integration. These investigators speculated that this may be due to the low 
ceiling of the auditory-visual integration test, or that the perceptual, skills 
might be most Important for initial acquisition whereas later reading emphasizes 
comprehension, which is more closely related to intelligence. Still another 
interpretation may be drawn from the work of Abravanel (1966) which Indicated 
that performance on a "same-different" task did not approach asymptote level 
until about five years of age. 

Gibson and 01 urn (1960) report a finding by Hemmendinger that young chil- 
dren respond globally to Rorschach ink blots at the age of three, increase 
their attention to details at about six or seven, and integrate parts into 
wholes at the age of nine or ten years. Gibson and 01 urn take the position that 
the perceptual factor in learning to read is closely related to the ability to 
discriminate meaningless or abstract forms. This type of reasoning provides 
the rationale for extensive use of matching exercises in "reading readiness" 

workbooks . 

A major and critical problem in most studies of perceptual discrimination 
in young children is the inadequacy of criteria or instruments for measuring 
this ability. The most commonly used tests are Bender (1938), also Koppltz, 
(1964) , Pros tig (1964), and Winterhaven (1966).. All of these, however, con- 
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found eye-hand coordination with perceptual discrimination skill. A new test 
developed by Rosenberg (1966) avoids this criticism by employing a pointing 
or selection response, but the highly structured and abstract stimuli have 
little intrinsic interest for the young child. In addition, there is no 
attempt to get at different types of discrimination tasks. 

The test most frequently used for the assessment of visual discrimina- 
tion with young children is the Frostig Developmental Test of Visual Percep- 
tion. It is based on a delineation of five hypothetically distinct areas: 
eye-hand coordination, figure-ground, form-constancy , position-in-space, and 
spatial relationships. Age norms for children between the ages of three and 
nine are available. The use of this instrument with children demonstrating 
dyslexia as well as other learning disorders has yielded dependable correla- 
tions. 

On the basis of this relationship (a '‘ post-hoc , propter hoc" fallacy) 
a number of studies have investigated the value of using a training program 
especially prepared by the Frostig school for the remediation of perceptual 
deficits. Cohen (1966) and Arciszewski (1967) used these materials in train- 
ing programs and found improved performance on the Frostig test, but no im- 
provement in reading. Olson (1966) found a significant correlation between 
the Frostig training materials and the vocabulary subtest of the California 
Achievement Test, but there was no significant relationship with the Gates 
Word Recognition Test. Olson found only one dependable correlation, that 
between position-in-space and reversible-words-in-context (r * .386). 

In the studies cited, where the children are in the elementary grades and 
can perform the motor tasks on which the tests are based, there seems to be no 
problem with the existing measures of visual discrimination. However, at this 
level the best experimental evidence indicates that providing remedial discrimi 
nation training for children who obtain low scores on such tests is not the 



most efficient method of producing improvement in reading skills. If the 
goal is increasing the child's ability to discriminate letter and word 
forms and to associate auditory responses with visual stimuli, it is far 
more effective to provide direct practice with the relevant materials. 

However, for the very young child, especially those from disadvantaged 
homes where opportunities to use paper-and-pencil are exceedingly limited, 
the available instruments for the measurement of visual discrimination are 
not appropriate. This statement is supported by data obtained by the authors, 
using the Frostig with a group of 17 four-year-old children, eight from a 
low socioeconomic group and nine from a middle socioeconomic group. For the 
first group, no child scored above the 20th percentile according to the 
Frostig norms. All the highly advantaged nursery school children, who had had 
considerable experience in similar tasks, scored below the 58th percentile. 
Three features of the test seemed to be most important in producing these 
results. 1) The language in which the tasks are presented tended to confuse 
rather than instruct, 2) The test took, on the average, about 40 minutes to 
administer and most of the children lost interest about half way through. It 
is of course possible to avoid this difficulty by administering the test in 
two parts on separate occasions, but this would not be consonant with the 
instructions in the test manual. 3) The use of different colors for marking 
was not only confusing, but also served to distract children from the task 
.since there was a general tendency to use the colors in drawing pictures. 

If there is predictive as well as face validity to the hypothesis that 
visual discrimination skills are important prerequisites to beginning reading, 
a way of measuring this skill in young children, without the confounding var- 
iables of motor skill, task comprehension, and experience with paper-and-pencil 
activities, must be developed. The present paper is a report of the construc- 
tion of one such instrument, as a first step in a more comprehensive study of 
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variables affecting children's ability to learn to read. 

Method 

Rationale and Description of Test 

The UCLA Discrimination Inventory (VDI) consists of 52 items in four 
subtests: figure-ground, form-constancy, closure, and position-in-space. 

These subtests are designed to measure separate aspects of visual discrimi- 
nation, each of which has been assumed to represent prerequisite skills 
important in learning to read. For instance, figure-ground may relate to 
the ability of a child to recognize familiar letters in new words; form- 
constancy seems to be prerequisite to identifying letters presented in dif- 
ferent sizes, such as upper and lower case; position-in-space may be involved 
when children read "b" as "d" or “p“ as "q"; and closure in the process of 
integrating letters into words. It is important to remember that these are 
hypothetical assumptions which must be subjected to empirical exploration. 

The test can be presented to groups of children, takes approximately 
15 minutes to administer, and very little time to score. 

The test items consist of 8 1/2“ x 11" frames (sheets of paper encased 
in plastic page protectors) with black-and-white drawings. (See Figure 1 
for sample items.) Each frame contains a model in the center of the top 



Insert Figure 1 about here 



half of the page and alternatives in three boxes across the lower half of 
the page. The task for the child is to select the one of the three alter- 
natives which is most like the model. 

Test Forms 



In the preliminary work, there seemed to be a tendency for children to 
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select the middle picture when the discriminations became too difficult. 

To estimate the effect of this position bias, as well as that of right-left 
preference, three forms of the test were constructed. In each form, the 
correct answer was in a different box. Thus, if in Form A the correct pic- 
ture was the left alternative, in Form B it might be in the middle box, and 
in Form C the right-hand box. The three forms were randomly presented over 
the total population. 

Subjects 

All the children, (291) between the ages of 3-0 and 5-11 in seven Day 
Care Centers and four private nursery schools were tested. There were 139 
boys and 152 girls, including 199 Negro and 92 Caucasian children, from two 
levels of economic status. Two measures of mental ability, the Peabody 
Picture Vocabulary and the Goodenough Draw-A-Man Test, were administered. 
Procedure 

All tests were individually administered in a small anteroom, if avail- 
able, or in a screened corner of the regular classroom. The child was asked 
to point to one of three pictures which corresponded to a model. However, 
when the first item with which the child was confronted showed the model on 
the same page as the alternatives, many children pointed to this picture when 
asked to find "another one" like it rather than to the one of three alter* 
natives which was most like the model. Task training was therefore necessary. 
It began by presenting the model on a separate card and gradually moving it 
into position in the center of the top half of the frame. If the child did 
not respond correctly to six consecutive items, the fcask-traintng program was 
repeated. The VDI was administered to children only after they had reached 
criterion on the basic task instructions. 
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Table 1 presents the mean scores on the total 52 item UCLA Visual 



Insert Table 1 about here 



Discrimination Inventory for 291 children, by sex, race, two levels of 
socioeconomic status, and three levels of age (3, 4, and 5 years). Since 
the test involves a three-choice selection, a score of 17 can be obtained 
by chance. Even the lowest mean score reported is significantly above 
this level. 

The uneven distribution of subjects within the 24 cells determined 
by the 2 x 2 x 2 x 3 design would have provided too few cases in several 
of these cells; therefore two separate 2x2x3 analyses of variance were 
computed. A principal objective of the first analysis (Table 2) was to 



Insert Table 2 about here 



determine whether any significant difference in visual discrimination could 
be attributed to sex, within age and race groupings. An F of .2 with almost 
300 cases provided assurance that, contrary to a number of studies which 
report important sex differences, there was little likelihood that boys would 
differ significantly from girls in their performance on this type of task. 
Whatever the factors that produce more beginning reading problems in boys than 
girls, they are probably not integrally related to the ability to make appro- 
priate visual discriminations. In the same analysis, significant differences 
(.01 level) were found for both age and race, although no interaction effects 
were dependabl e . 
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The second analysis of variance (Table 3) was concerned with the 



Insert Table 3 about here 



main effect of socioeconomic status, again within age and race groupings. 
While the T of 3.4 approaches significance, it is not sufficient to dis- 
prove the null hypothesis that differences in visual discrimination are 

* 

% 

not related to socioeconomic status. However, the main effects of race 
and age were verified, with again no interaction effects supported. A 
Newman-Keuls analysis shows that main effect differences are primarily 
attributable to the lower scores of the three-year-old and Negro children 
in the lower SES groups. 

Of the 291 children, 161 were given the Peabody Picture Vocabulary 
and the Goodenough Draw-a-Man Test. The relationship among these measures 
and the VDI, as well as the relevant means and standard deviations, are 
presented in Table 4. 



Insert Table 4 about here 



As indicated earlier, three forms of the VDI were used to control for 
the effects of position preference. Intra-form reliability for the four 
subtests and the total test are presented in Table 5. Using Horst's re- 



insert Table 5 about here 



vision of the Kuder-Richardson Formula 20, total test reliabilities of .90 to 
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.92 were found for the three forms. When data for items across forms are 
summed, the K-R reliability coefficient is .88. The Spearman-Brown pro- 
phesy formula for the same data produces a reliability estimate of .91. 

Discussion 

Although the design of the present investigation anticipated differ- 
ences in performance due to sex, age, race, and socioeconomic status, the 
data presented in Tables I through 3 indicate that primary differences are 
attributable only to age and race. According to a Newman-Keuls analysis, 
the performance of all three-year-old children does not differ significantly 
as a function of sex, race, or SES. Also, within these groups, particularly 
with Caucasian children of either high or low socioeconomic status, perform- 
ance at ages four and five are not significantly different; it is between the 
ages of three and four that major changes seem to take place. This is in 
line with other studies which demonstrate that visual discrimination skills 
attain almost maximum development by the 5th to 7th years. 

With Negro children, high SES groups show gradual growth from the third 
to the fifth year; the low SES children have very meager gains and at the 
fifth year are still not significantly superior to the average three-year-old 
Caucasian child. This evidence provides additional support for the "cumula- 
tive deficit" which characterizes disadvantaged children in so many areas 
related to academic performance. If this difference is found between nursery 
and day care populations, where there is a considerable degree of overlap, it 
suggests that more severely disadvantaged children will demonstrate even greater 
deficits. It is highly likely that an analysis including data from Headstart 
classes would show significant main effects for socioeconomic status. 




A number of studies have indicated that there is a positive, albeit 
weak, relationship between intelligence and visual discrimination. In 
Table 4 the correlations of the VDI with the Peabody and the Goodenough 
support this finding. However, the data of the present study show that 
Caucasian high SES children have significantly higher correlations between 
the VDI and either of the measures of mental ability than do the Negro low 
SES children. It is also worth noting here that the advantaged grouos obtain 
higher scores on the PPVT, which is a test of verbal ability, than they do 
on the Goodenough, with the reverse true for the disadvantaged children. Evident- 
ly, while superior verbal ability usually accompanies high scores on the VDI, 
the discrimination task seems to tap a type of ability which is not necessarily 

tied to conventional measures of intelligence. 

In establishing the usefulness of any instrument, data concerning validity 
and reliability are c** paramount importance. At the present stage of its de- 
velopment, the UCLA VDI has established considerable construct validity, as 
evidenced by the subtest reliability. It also has a high level of face validity, 
based on traditional, commonsense association of certain reading subskills with 
ability to respond appropriately to differences and likenesses in visual stimuli. 
A more critical type of validity relates to an instrument's ability to predict 
performance on a criterion measure. A study is now being planned to obtain a 
measure of predictive validity, using performance in beginning reading as one 
important criterion. 

With reference to reliability, several measures have been computed and 
reported in Table 5. The values obtained are sufficiently high to engender 
confidence in the instrument, especially considering the fact that the population 
tested is a comparatively homogeneous one. 
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To/sunmarize*, the use of the UCLA instrument has demonstrated its use- 

% 

fulness in identifying a range of ability in visual discriuri nation with 
several population groupings. Additional work is now being done to develop 
the four subtests as measures of component elements of a broad visual dis- 
crimination ability. 

Large pools of items are being administered over a broad population of 
children. The data will then be subjected to factor analysis to identify 
the most sensitive items and those which have minimum overlap. It is ex- 
pected that a strong and reliable measure of visual discrimination, which 
can be used with young children from a wide range of socioeconomic back- 
grounds, will be developed. It is hoped that this type of measure, given as 
a routine test at the kindergarten or first grade level, will identify chil- 
dren who may be expected to demonstrate a wide variety of problems in dealing 
with the environment. Of major importance will be the potential ability to 
predict cases of Incipient dyslexia, as well as uther learning disorders, be- 
fore they have become severe enough to be obvious to the classroom teacher. 
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Table 2 



Analysis of Variance (Race x Sex x Age) 



Source 


df 


MS 


F 


Race (A) 


1 


1253.3 


23.8** 


Sex (B) 


1 


9.0 


.17 


Age (C) 


2 


778.6 


14.8** 


A x B 


1 


0.31 


.01 


A x C 


2 


127.5 


2.4 


B x C 


2 


96.45 


1.8 


A x B x C 


2 


158.2 


3.0 


Error w 


279 


52.6 





** p <( .01 
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Table 3 

Analysis of Variance (Race x SES x Age) 



Source 


df 


MS 


F 


Race (A) 


1 


310.7 


5.8* 


SES (B) 


1 


179.9 


3.4 


Age (C) 


2 


484,3 


9. OS** 


A x B 


1 


0.5 


.01 


A x C 


2 


62.8 


1.2 


B x C 


2 


6.1 


.11 


A x B x C 


2 


21.31 


.40 


Error, , 


279 


53.3 













* p < .05 

** p ^ .01 











Table 4 

Correlations on UCLA Visual Discrimination Inventory with Goodenough Draw-a-Man Test 
and Peabody Picture Vocabulary Test by Race and SES 



Group 


N 


VDI 


DAM 


PPVT 


M 


SD 


M 


SD 


r 


M 


SD 


r 


Negro 


78 


27.7 


6,7 


48.2 


10.6 


.21 


41.7 


10.9 


.25 


Caucasian 


83 


33.2 


7.4 


51.5 


14.5 


.58 


59.6 


14.2 


.55 


High SES 


70 


33.4 


7.6 


51.3 


14.9 


.68 


61.4 


14.6 


.59 


Low SES 


91 


28.4 


6.9 


48.9 


11.0 


.19 


42.8 


10.7 


00 

CM 

• 




Table 5 

Total and Subtest Reliabilities Based on Population of 112 Children 
from Low Socioeconomic Group Only (Combined Race and Age) 



Test and Subtests by Forms 


N 


M 


SO 


K-R 20 1 


Total Test (52 Items) 










Form A 


36 


30.1 


8.8 


.90 


Form B 


38 


32.1 


9.2 


.92 


Form C 


38 


31.0 


8.4 


.90 


Flaure-Ground (13 Items) 










Form A 


36 


8.9 


3.4 


.88 


Form B 


38 


9.3 


2.3 


.71 


Form C 


38 


8.0 


3.0 


.85 


Form-Constancy (13 Items) 










Form A 


36 


7.5 


2.6 


.71 


Form B 


38 


8.0 


2.5 


.71 


Form C 


38 


7.6 


2.4 


.62 


Closure (13 Items) 










Form A 


36 


6.8 


2.3 


.69 


Form B 


38 


7.9 


2.8 


.82 


Form C 


38 


6.7 


2.4 


.77 


Pos1t1on-1n-Soace (13 Items) 










Form A 


36 


6.9 


2.4 


.64 


Form B 


38 


6.7 


2.9 


.85 


Form C 


38 


7.7 


2.9 


.86 



^Horst's revision. 



ABSTRACT 



Tests of visual discrimination which usually involve eye-hand 
coordination, produce scores which may confound motor skill with 
visual discrimination ability. To avoid this problem, a test which 
requires a selection rather than a drawing response has been developed 
at UCLA. There are 52 items, 13 in each of 4 categories: form-constancy, 
figure-ground, closure, and position-in-space. The test was administered 
to 291 children, Negro and Caucasian, in 3 age groups (3, 4, and 5) and 
2 levels of socioeconumic status. Two measures of internal consistency, 
Spearman -Brown r * .91 and Kuder-Richardson r = .88, Indicate an accept- 
able level of reliability. Analysis of variance showed significant main 
effects for age and race. Low positive correlations with mental age 
Indicate a minimal intelligence component. The test should prove valuable 
for assessing a skill usually associated with beginning reading. 
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